PHOG: Probabilistic Model for Code

نویسندگان

  • Pavol Bielik
  • Veselin Raychev
  • Martin T. Vechev
چکیده

We introduce a new generative model for code called probabilistic higher order grammar (PHOG). PHOG generalizes probabilistic context free grammars (PCFGs) by allowing conditioning of a production rule beyond the parent non-terminal, thus capturing rich contexts relevant to programs. Even though PHOG is more powerful than a PCFG, it can be learned from data just as efficiently. We trained a PHOG model on a large JavaScript code corpus and show that it is more precise than existing models, while similarly fast. As a result, PHOG can immediately benefit existing programming tools based on probabilistic models of code.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Localization of the Lumbar Discs Using Machine Learning and Exact Probabilistic Inference

We propose a novel fully automatic approach to localize the lumbar intervertebral discs in MR images with PHOG based SVM and a probabilistic graphical model. At the local level, our method assigns a score to each pixel in target image that indicates whether it is a disc center or not. At the global level, we define a chain-like graphical model that represents the lumbar intervertebral discs and...

متن کامل

Recognition of Handwritten Mathematical Symbols with PHOG Features

Converting handwritten formulas to LaTex is a challenging machine learning problem. An essential step in the recognition of mathematical formulas is the symbol recognition. In this paper we show that pyramids of oriented gradients (PHOG) are effective features for recognizing mathematical symbols. Our best results are obtained using PHOG features along with a one-againstone SVM classifier. We t...

متن کامل

Berkeley PHOG: PhyloFacts orthology group prediction web server

Ortholog detection is essential in functional annotation of genomes, with applications to phylogenetic tree construction, prediction of protein-protein interaction and other bioinformatics tasks. We present here the PHOG web server employing a novel algorithm to identify orthologs based on phylogenetic analysis. Results on a benchmark dataset from the TreeFam-A manually curated orthology databa...

متن کامل

New Robust Descriptor for Image Matching

Nowadays, object recognition based on feature extraction is widely used in image matching due to its robustness to different types of image transformations. This paper introduces a new approach for extracting invariant features from interest regions. This approach is inspired from the well known Scale Invariant Feature Transform (SIFT) interest points detector and aims to improve the computatio...

متن کامل

A Comparison of Seismic Safety of Steel MRF Designed According to Different Editions of Iranian Seismic Code

In this paper, efforts are made to compare the safety of steel moment resistant frames designed according to different editions of the Iranian code of Practice for seismic resistant design of buildings. Also, failure risk of a low and medium height frame which designed for high and low seismicity regions according to three editions of the code are evaluated. First, the testing cases were design...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016